gh-149449: Fix use-after-free in _PyUnicode_GetNameCAPI#150323
Merged
Conversation
Contributor
|
I think the simpler alternative to this is to just statically allocate |
The _PyUnicode_Name_CAPI struct was malloc'd per import and freed by
the capsule destructor, leaving the per-interpreter cached pointer
dangling once unicodedata was removed from sys.modules and gc'd. The
\N{...} parser path and the namereplace codec handler then crashed.
Allocate the struct in static storage and drop the destructor; the
contents are immutable function pointers shared across imports.
b942917 to
ca33d17
Compare
Co-authored-by: Kumar Aditya <kumaraditya@python.org>
kumaraditya303
approved these changes
May 24, 2026
|
Thanks @eendebakpt for the PR, and @kumaraditya303 for merging it 🌮🎉.. I'm working now to backport this PR to: 3.13, 3.14, 3.15. |
|
GH-150352 is a backport of this pull request to the 3.15 branch. |
|
GH-150353 is a backport of this pull request to the 3.14 branch. |
|
GH-150354 is a backport of this pull request to the 3.13 branch. |
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
The cache stored the raw struct pointer extracted from unicodedata's _ucnhash_CAPI capsule without keeping a reference to the owning capsule. Dropping the module from sys.modules and running gc.collect() then freed the struct while \N{...} decoding and the namereplace codec handler were still using it. Hold a strong reference to the capsule on the interpreter state and Py_CLEAR it in _PyUnicode_Fini.
This needs backports I think, as it also affects python versions 3.10 to 3.15.
Claude was used to construct a PR.